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Abstract 



We consider a temporal version of the CHSH scenario using projective measurements on a 
single quantum system. It is known that quantum correlations in this scenario are fundamen- 
' ,—H ' , tally more general than correlations obtainable with the assumptions of macroscopic realism 

and non-invasive measurements. In this work, we also educe some fundamental limitations of 
these quantum correlations. One result is that a set of correlators can appear in the temporal 
CHSH scenario if and only if it can appear in the usual spatial CHSH scenario. In particular, 
we derive the validity of the Tsirelson bound and the impossibility of PR-box behavior. The 
strength of possible signaling also turns out to be surprisingly limited, giving a maximal com- 
munication capacity of approximately 0.32 bits. We also find a temporal version of Hardy's 
nonlocality paradox with a maximal quantum value of 1/4. 

^h ' 1 Introduction 

Quantum theory displays many counterintuitive features which are in stark contrast to our every- 
day experiences in the macroscopic world. Possibly the most extreme of these is the collapse of 
If} • the wavefunction due to measurement; its contentious interpretation has given rise to the measure- 

ment problem. Obviously, the only possibility to observe and study wavefunction collapse and its 
entailments is to conduct measurements on the collapsed wavefunction. Therefore, in order to gain 
a better understanding of what the collapse means and how it occurs, one has to study repeated 
measurements on the same quantum system, both from a theoretical and from an experimental 
perspective. This should be seen as motivation for our work on temporal quantum correlations. 
In theories different from orthodox quantum mechanics, for example when wavefunction collapse is 
not absolutely instanteous |Pea99j , the properties of temporal correlations are likely to be different 
from those presented here. 

Quantum correlations have mostly been investigated for scenarios of several spacelike separated 
parties sharing some nonlocal correlations. The simplest situation one can consider here is the 
Clauser-Horne-Shimony-Holt (CHSH) CHSH69 scenario: two parties, commonly dubbed Alice 
and Bob, each operate with a physical system of their own on which they respectively conduct one 
of two dichotomic (i.e. two- valued) measurements. Then, on the one hand, quantum theory entails 
phenomena that cannot be achieved classically: many quantum states that have the property of 



"This work grew out of joint discussions with Sibasish Ghosh and Tomasz Paterek. 



being entangled let Alice and Bob observe correlations between their measurements which cannot 
be explained by classical models defined in terms of local hidden variables; this non-classicality can 
be detected by observing violations of the CHSH inequalities. These inequalities precisely char- 
acterize those correlations having local hidden variable models. Furthermore, Hardy's nonlocality 
paradox [Har93 shows that this feature is not solely a quantitative trait of the joint outcome prob- 
abilities: it proves that there also exists a qualitative difference between quantum correlations and 
the realm of local hidden variable models. On the other hand, it has been found out that there are 
nevertheless strict limitations on which correlations can be observed with quantum-mechanical sys- 
tems. The Popescu-Rohrlich box (PR-box) is a joint probability distribution that is consistent with 
the causality principle of no-signaling, but yet such a PR box cannot be constructed in a quantum- 
mechanical world. This can be seen most directly from the Tsirelson bound, which specifies the 
maximal quantum violation of the CHSH inequalities. 

In this paper, we study a temporal version of the CHSH scenario. We may imagine a single 
physical system in a laboratory, on which the two experimentalists Alice and Bob can conduct their 
measurements. However it so happens that their work shifts do not intersect, and Alice leaves the 
lab before Bob arrives. Now it is known that Alice, during her shift, has measured one of the two 
±l-valued observables a\ or 02, and likewise, Bob will measure one of the two ±l-valued observables 
61 or &2- It is crucial to assume that Alice only conducts one of the two projective measurements 
ai and a<ii so that she cannot disturb the system and its natural dynamics in any other way. Then 
which joint probability distributions for the measurement outcomes can possibly arise in this way? 
In the following sections we answer certain aspects of this question. Just like in the spatial case, 
we find both fundamental possibilities achievable by such quantum correlations, and fundamental 
limitations on these quantum correlations. There are analogues of all the spatial phenomena men- 
tioned in the previous paragraph: impossibility of hidden variable models — following |Lap06| , no 
locality or non-invasiveness assumption is actually needed — , a version of Hardy's paradox which 
turns out to be stronger than in the spatial scenario, the possibility of signaling in a limited form, 
impossibility of the PR-box, and the Tsirelson bound. Moreover, although the set of joint probabil- 
ities realizable by spatial quantum correlations is strictly contained in the set of joint probabilities 
realizable by temporal quantum correlations, we find that the set of realizable correlators is the 
same in the temporal case as in the spatial case. 

There has been a considerable amount of previous work on the properties of temporal quan- 
tum correlations. In particular, the Leggett-Garg inequalities |LG85j characterize the probabilistic 
hidden variable models for the scenario that one measures two-time correlators between three ±1- 
valued observableqj, and it is known that these can be violated quantum-mechanically In contrast 
to spacelike separated situations, it is not necessary here to have more than one observable for 
each "party" , i.e. at each point in time, since the observables between the different points in time 
need not commute, leading to specifically quantum phenomena. Very recently, Avis, Hayden and 
Wilde AHW] have classified all tight Leggett-Garg inequalities for the two-time correlators be- 
tween any number of dichotomic observables as precisely the facets of the cut polytope. Some other 
relevant references include |Lap06| and jBTCVj . 



1 In the standard scenario, these three observables are actually a single observable measured at three different 
times, but this assumption is not relevant to the argument. 



2 Joint probabilities in the temporal CHSH scenario 

We start with several statements about temporal correlations between projective quantum mea- 
surements of ±1- valued observables. Then we describe the temporal CHSH scenario, which has 
been outlined in the introduction, in a little more detail. 

2.1 Setting the stage. Consider a single quantum system with an underlying Hilbert space T-L 
and dynamics described by the Hamiltonian H. Furthermore, we have ±l-valued, i.e. dichotomic, 
observables a and 6, which are hermitian operators on % with the property 

a 2 = 1, b 2 = 1. 

Note that we can bring any pair of two- valued observables into this form by relabelling the outcomes 
as +1 and — 1. Now Alice measures a at time tA and Bob measures b at time £#. Both measurements 
are assumed to be perfect projective von Neumann measurements, so that the state collapses to an 
eigenstatc of the corresponding observable upon the measurement. This assumption is relevant for 
Alice since it limits the way in which her measurement a can influence the system; we will see in 
paragraph 12.31 that if we would allow arbitrary generalized measurements (Liiders measurements) 
for Alice, then any set of joint outcome probabilities without signaling from Bob to Alice could be 
modelled even with commuting Kraus operators, i.e. with a classical probabilistic system. 

However for Bob, the assumption of projective measurements is not essential: since his post- 
measurement state does not get measured, this post-measurement state is irrelevant and only his 
outcome probabilities matter. And concerning these, we can always enlarge the Hilbert space to 
turn any POVM into a projective measurement while preserving the outcome probabilities. 

We take the system to be in the pure initial state \i/j) just before Alice's measurement at time 
tj^. The assumption of a pure initial state is merely for notational convenience, and all following 
calculations would also apply mutatis mutandis to the case of a mixed initial state. Note also that 
in case of a mixed initial state described by a density operator p on H, we can replace it by a 
purification \ip) on'H®'H' for some H', while replacing the observables a and b by a <g> 1 and b (g> 1. 
This retains all joint outcome probabilities. 

When working in the Heisenberg picture, the unitary evolution of the state is trivial, while Bob's 
observables evolve according to 

y = e -iH{t B -t A ) he iH(t B -t A ) _ 

Since the observable b was arbitrary, the evolved observable b' is also just an arbitrary ±l-valued 
observable on %. Hence as far as the existence of quantum-mechanical models for joint probabilities 
is concerned, the dynamics is irrelevant. In particular, we will choose H — for simplicity, so that 
b' = b. Then wavefunction collapse is the only "dynamics" present in our formalism. 

2.2 Joint probabilities and correlators. Now we calculate the joint probabilities in terms 
of a, b and l^). For the ±l-valued observable a, the projection operator onto the +l-eigenspace 
and the projection operator onto the — 1-eigenspace are given by, respectively, 

1+a 1-a 



and in the same way for b. Using the Born rule together with the projection postulate shows that 
the joint probability for Alice to get the outcome r € { — 1, +1} and for Bob to get the outcome 
s 6 { — 1, +1} reads as 
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P(r, s) = ( tp — - — • — - — • — - — ip 

(1) 

irs(^|{a,6}|^) + is(V'|a6a|^). 

In this expression, {•,•} denotes the anticommutator of two operators. P(r,s) is the probability 
that Alice observes the outcome r, multiplied by the probability that Bob gets the outcome s upon 
measuring the state of the system after state collapse due to Alice's outcome being r. 
We also consider correlators, which are defined as 

c - Yl rs p ( r ' s ) 

(2) 
= P(+l, +1) + P(-l, -1) - P(-l, +1) - P(+l, -1). 

Using (fTJ), the correlator can be expressed as 

C = ^\{a,b}\i>) (3) 

which is intuitive since only the rs-term in equation (Q]) suggests any kind of correlation between 
the outcomes. So strangely, even though our scenario has a clear temporal order, the correlators do 
not depend on who measures hrst! As far as we can see, this curious property does not generalize 
to observables with more than two outcomes or to scenarios with more than two parties. 

Note that when we use the term "correlation" , we simply mean "specification of joint outcome 
probabilities for all allowed choices of observables" , while the notion of "correlator" refers only to 
the quantity ©. 

2.3 The CHSH scenario. In the CHSH scenario, Alice and Bob both have an independent 
choice between two observables. While Alice can select either the observable a\ or the observable 
<22, Bob has the freedom to measure either b\ or &2- For each of the resulting four choices, we obtain 
a distribution of joint probabilities of the form (flj. We will use the notation 

P(r,s\k,l) (4) 

to denote the probability that Alice gets the outcome r € {— 1, +1} and Bob gets the outcome 
s G { — 1, +1}, given that Alice measures au and Bob measures bi. Finally, we will use the notation 
Cki for the correlator between o^ and bi. 

As announced in paragraph 12.11 it will now be proven that any set of probabilities Q has a 
quantum-mechanical representation in terms of generalized measurements (Liiders measurements) 
for Alice, under the assumption that these probabilities satisfy causality in the sense that there is 
no backward signaling from Bob to Alice. This intuitive condition means that the joint probabilities 
can be factorized as 

P(r,s\k,l) = P B (s\r;k,l)P A (r\k) (5) 

where PA(s\k) designates the outcome probabilities for Alice's measurement alone, and these are 
assumed to be independent of Bob's data I and s. On the other hand, Bob's conditional outcome 



probabilities Pe(s|r; k, I) may well depend on Alice's data in an arbitrary way. Condition (|5|) is 
necessary for the existence of a representation via generalized measurements, since the product 
representation ([5]) is essentially how one would typically calculate the joint probabilities starting 
from the quantum-mechanical data: first determine Alice's outcome probabilities PA(r\k) given 
the initial state \ip), then calculate Bob's outcome probabilities Pb(s\t; k, I), and finally multiply 
these two probabilities to obtain the desired result. Bob's probabilities Ps(s\r; k, I) depend on the 
system's quantum state after Alice's measurement, and this state in turn is determined by k, r and 

Conversely, in order to find a quantum-mechanical representation for an arbitrary such set of 
probabilities, consider a five-dimensional Hilbert space with orthonormal basis {|0), |1 + ), |1~), |2 + ), |2~)}. 
We take the initial state of the system to be \il>) = |0). There exist generalized measurements such 
that the state after Alice's measurement is |1 + ) if she measured a\ and obtained a +1 outcome, 
and it is |1 _ ) if she measured a\ and obtained a —1 outcome, and similarly for |2 + ) and |2~). 
Concretely, one can implement such measurements for example by using the Kraus operators 

v h r = y/pim \k r )(o\ + Y.72 \ k ' r ')( k ' r 'l r e {-i.+i} 

k' ,r' 

as describing the measurement of afc. The first term guarantees that the post-measurement state of 
V£ is the desired \k r ) and that the given measurement statistics are reproduced, both on the initial 
state \tp) — |0). (The other terms are merely needed for satisfaction of the completeness relation 
Er V k V k = !•) For Bob ' we can choose the two POVMs {n^Hf }, {n^n^} with 

Ilj = diag (|, P(s\ + 1; 1, 0, P(s\ - 1; 1, 0, P(M + 1; 2, I), P(s\ - 1; 2, I)) 

as representing the measurements b\ and 62; since Bob's post-measurement state does not get ob- 
served, we do not have to specify any Kraus operators implementing these POVMs. By construction, 
these POVMs reproduce the desired outcome probabilities Pb{s\t] k, I) on the corresponding states 
\k r ) € {|1 + ), |1~), |2 + ), |2~)}. This ends the construction of a quantum-mechanical model with 
generalized measurements for ([5]). Some final remarks: since neither the initial state nor any post- 
measurement state is a superposition of basis states, this construction effectively yields a classical 
stochastic system. The trick in the construction is that Alice's post-measurement state keeps track 
of both her measurement setting and her outcome. This conditional state collapse to mutually 
orthogonal states would not be possible if we would only allow projective measurements for Alice. 

2.4 Temporal hidden variable models. Using the assumption of what they called "macro- 
scopic realism" and "non-invasiveness" , Leggett and Garg [LG85J derived an inequality satisfied 
by temporal correlations in hidden variable models which is violated by certain temporal quantum 
correlations. Macroscopic realism is the assumption that the system is, at each instant in time, 
definitely situated in one of several distinct states. This system state determines all measurement 
outcomes exactly; in this sense, all observables possess preexisting definite values. This is thought 
to apply to macroscopic objects in particular, hence the name "macroscopic realism" , or more 
succinctly "macrorealism" . 

The crucial assumption now is non-invasiveness: this postulates that a measurement does not 
disturb the state of the system. There is an additional hidden assumption which has been made 
explicit and dubbed "induction" by Leggett Leg08 : it is understood that the state of the system 
at time t is sufficient information to calculate the outcomes of all future measurements. (In other 



words, causality only propagates forward in time.) All of these assumptions seem rather natural 
when dealing with macroscopic systems. In a manner analogous to the spatical case, one can now 
use these premises to derive (see |Leg08| , compare jBTCVj ) the temporal CHSH inequality: 

<Schsh = Cn + C12 + C21 — C22 < 2. (6) 

On the other hand, it is known that this inequality can be violated by certain quantum correla- 
tions BTC V . This is an exciting area due to promising prospects of using such results for testing 
the applicability of quantum theory in the macroscopic domain [PLMN + 10j . 
We will get back to hidden variable models in section |U 

2.5 Comparison to the spatial scenario. In general, the non-invasiveness assumption for 
hidden variable models is the exact analogue of locality in the spatial case. In both cases, the dis- 
tribution of joint measurement outcomes is a probabilistic combination (i.e. a convex combination) 
of a collection of realistic models; a realistic model in turn is described by a hidden variable A, 
constant over space and time, which determines all the outcomes of all possible measurements in a 
definite way. Therefore, there is absolutely no difference between local hidden variable models in 
spatial scenarios, and non-invasive hidden variable models in temporal scenarios. 

So the reason that one considers inequalities characterizing hidden variable models for temporal 
scenarios which are different from those in the spatial case is not that the hidden variable models 
are different — they are the same. The reason is that the quantum- mechanical correlations are very 
different and strongly depend on whether one considers a spatial scenario or a temporal scenario. 
Although the Leggett-Garg inequality is perfectly valid as a spatial Bell inequality in a three-party 
scenario, it is not interesting in this case: since there is only one observable per party, no quantum 
violations are possible, and likewise no violations by more general no-signaling theories. 

Let us also note that any set of joint outcome probabilities for a spatial Bell test can also appear 
in the temporal scenario. Mathematically, this follows from the fact that we recover exactly the 
spatial joint probabilities by taking a and b in (JT]) to operate on separate tensor factors. Physically, 
this is clear since we can just think of Alice's and Bob's spatially separated quantum systems as a 
single quantum system, and then simply imagine that Alice conducts her measurement first, with 
Bob's measurement operating at a later time. 

To end the comparison with spatial scenarios, let us recast ([3]) in the following form: 

Proposition 2.1. While a spatial correlator is given by the expectation value of the tensor products 
of the observables, a temporal correlator is given by half the expectation value of the anticommutator 
of the observables: 

spatial: C = (ip\a ® b\ip) — > temporal: C = ^(ip\{a, b}\tp) 

2.6 The qubit case and beyond. As a first example of temporal quantum correlations, we 
consider a single qubit in the Bloch sphere picture. This case has also been treated in [BTC Vj . 

Let the system have an initial state given in terms of the Bloch vector v. A dichotomic observable 
is described by a unit vector a GM 3 , such that the probability for getting the outcome re { — 1, +1} 
on the state v is given by 

±{l + ra-v) (7) 

And in case that the outcome r has been observed, the state has collapsed to r a. 
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The dynamics of the qubit between t\ and t 2 hi this representation is specified by a rotation 
matrix R £ SO(3), such that the state prior to Bob's measurement is R(ra) — r R(a). Then given 
that Alice obtained the outcome r, the probability for Bob to get the outcome s is consequently 

1(1 + rs b- R(a)). (8) 

After multiplying the two expressions ([7]) and ([8|) to get the joint probability and summing over r 
and s with the appropriate sign, the correlator explicitly reads according to the definition (J5J) 

C 



(9) 



So remarkably, this correlator does not depend on the initial state, which is due to the collapse after 
Alice has measured, and the structure of the correlator as a particular linear combination of joint 
probabilities. This correlator is very similar to the correlator known from maximally entangled 
two-qubit states and therefore we can now find the maximal qubit value using simple techniques. 
The CHSH quantity then reads: 

oqubit f~t . (~i i ri (~i 

^CHSH — ^H + ^12 + ^21 — <^22 

= R(Si) ■ (hi + b 2 ) + R(a 2 ) ■ (h - b 2 ) 

For finding its maximum, note that since the vectors b are normalized, the vectors in the brackets 
are orthogonal. Moreover, \bi + b 2 \ 2 + |?i — b 2 \ 2 = 4 and so we can introduce two new orthogonal 
normalized vectors b + and 6_ such that bi + b 2 — 2 cos a b + and 61 — 62 = 2 sin a 6_ for some angle a. 
Plugging this into the expression for Squbit an d optimizing over the R(3i), which are also normalized 
vectors, yields the Tsirelson bound of 2-\/2, which is therefore the maximal value achievable with 
a qubit. In particular, this violates the bound ©, confirming that quantum theory cannot be 
equivalent to a probabilistic hidden variable theory with preexisting values for all observables and 
repeatable measurements. 

All the concrete examples of temporal quantum correlations which we will consider in the fol- 
lowing sections are modelled on qubits. So here let us quickly demonstrate that not all quantum 
correlations in the temporal CHSH scenario can arise from qubit data. Consider a qutrit system 
with orthonormal basis {|0), |1), |2)}, and the following prescriptions: 

• the initial state \ip) = |0), 
ai measures if the system is in the state |0) + |1), 
a 2 measures if the system is in the state |0) + |2), 
61 measures if the system is in the state |2), 

• 62 is any dichotomic observable. 
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This system has the following properties: Alice's outcomes both have probability 1/2, independent 
of whether she chooses a,\ or a 2 . But her choice drastically affects Bob's prospects upon measuring 
b\: when Alice chooses 01, he will definitely observe a —1 outcome; however when Alice chooses 
a2, his outcome will be uniformly random and independent of hers. Such behavior is impossible 
in a qubit system: one would necessarily need to have b\ = — 1, otherwise Bob's outcome could 
not be definite after Alice's non- trivial measurement of a\ . But then obviously his outcome would 
also have to be a definite — f when Alice measured 02, which it is not allowed to be. It would be 
interesting to try and turn this into a dimension witness in the sense of [B PA + 08j . 

3 Correlator space and the Tsirelson bound 

We may ask whether the temporal correlators satisfy the Tsirelson bound generally, or whether this 
just holds for the case of a qubit system. From the qubit case we know that the Tsirelson bound 
can be attained; but a priori, some temporal quantum correlations may in principle be so strong 
that even the Tsirelson inequality 

Schsh = Cxi + C12 + C21 - C 22 < 2%/2 (10) 

is violated. 

What we mean here by correlator space is the set of quadruples 

(Cn, C12, C21,C22) 

which can appear as correlators between Alice's and Bob's measurements in a quantum-mechanical 
world. Recall that the correlators are defined as 

C kl = Y, rsP(r,s\k t l) (II) 

r,«e{-l,+l} 

so that there is a linear map from probability space down to correlator space. Obviously, taking 
the projection of a point from probability space down to correlator space throws away some data, 
so specifying the four correlators is not sufficient for knowing the full set of joint probabilities. Yet 
the correlators contain precious information about the system, for example the maximal violation 
of the CHSH inequality, and they are also related to the possibility of producing PR-box behavior 
(see section [4j. 

For the remaining part of this section, we will consider the scenario in which Alice has a choice 
between m € N dichotomic observables, while Bob has a choice betweeen n € N dichotomic observ- 
ables. Even in this generality, it is not hard to use the techniques of Tsirelson for showing that, in 
correlator space, the temporal quantum region coincides with the spatial quantum region. Tsirelson 
has proven in his paper |Tsi85j that the following three statements are equivalent, for any given 
matrix of correlators {Cki)Z—^"' '■ 

(a) There exists a C*-algebra A with identity, hermitian elements a\, . . . , a m , bi,...,b n and a 
state / on A such that for any k, I, we have 

a k bi = bia k , 

-1 < a k < 1; -1 < k < 1, 

f{a k bi) = C k i. 



(b) There exist Hilbert spaces H a and Hb together with Hermitian operators ax, ... , a m G B(H a ), 
&i, . . . , b n <E B(Hb) and a density matrix p on H a ® Hb such that 

a\ = 1; bf = l 

tr (p(a k ® 6 Z )) = C fc( 

(c) In the Euclidean space of dimension min(m, n), there exist vectors x\, . . . , x m , y%, . . . , y n such 
that 

\xk\ < l; M < l; (xk,yi) = C k i Vfc,Z 

Proposition 3.1. These conditions are also equivalent to the following two: 

(a') There exists a C* -algebra A with identity, hermitian elements a\, . . . , a m , 61, . . . , b n and a 
state f on A such that for any fc, I we have 

-1 < a k < 1; -1 < bi < 1 

f{\{a k ,b l })=C kl 

(b'J There exists a Hilbert space % together with Hermitian operators a±, . . . , a m , b\, ... , b n G B(H.) 
and a density matrix p on % such that 

a\ = 1; b\ = \ 

tr (p- ^{a k ,b t }) = C u 

Proof. We first show that (b)=>(b'). Given the data as in (b), it is clear that they also satisfy (b') 
if we take H — Ha <8> Hb and rename a k ® 1 to a k and 1 (8) bi to h. 

The implication (b')=>(a') easily follows by choosing A = B(H), and f{x) = tr(p • x). 

To close the circle of implications, we will now check that (a')=>(c). But this works in exactly the 
same way as Tsirelson's own proof |Tsi85j that (a)=>(c): start with the finite-dimensional vector 
space defined to be the R-linear span of the a k and the bi. This vector space carries an inner 
product, possibly degenerate, which is defined as 

(x,y) = f(±{x,y*})=Ref(y*x) 

After quotiening out the null space, this inner product becomes positive definite and produces a 
Euclidean space such that 

\a k \ 2 = (a k ,a k )=f(a 2 k )<l, \b t \ 2 = (6,, bi) - f(bf) < 1, (a k ,bi)=C M 

as required. Now just as in [Tsi85] , all the requirements of (c) are satisfied, except that the 
dimension of the space has to be at most min(m, n). This can also be easily achieved by orthogonal 
projection of the vectors x k = a k onto the subspace spanned by the vectors yi = bi, or in the other 
way around. □ 

By ©, we have therefore proven the following result: 

Theorem 3.2. A matrix of correlators {C k i) k ~ = {'"' m can appear as temporal correlations between 
dichotomic projective measurements on the same system if and only if it can appear as spatial 
correlations between dichotomic measurements on two spatially separated entangled systems. 

In particular, this implies that the Tsirelson bound (|10l) is indeed generally valid in our temporal 

setting. 

9 



4 Impossibility of PR-box correlations 

We say that a PR-box correlation is a set of joint probabilities P(r,s\k,l) which has the property 
that the outcomes r and s are equal if and only if k = I = 2. This property is equivalent to the 
requirement that the four correlations (J3|) are given by 

C u = C12 = C 2 i = -1, C 22 = +1. (12) 

Correlations of this form could be used e.g. to achieve optimal better-than-quantum performance in 
two-party XOR games (see e.g. [CHTW04 ). When the joint probabilities P(r,s\k,l) are assumed 
to be no-signaling, then this requirement actually fixes all values for the probabilities uniquely; 
however this does not apply here as our temporal scenario allows signaling from Alice to Bob. 
Starting from ([3]), we now determine when a correlator Cu can have a value of ±1, 

C H = £M{a fc ,&iM=±l, 

which is equivalent to 

(ip\a k bi\^) + (ip\bia k \ip) =±2. 

But now since the absolute value of each term is < 1, and becomes 1 if and only if \ijj) is an eigenstate 
of the respective operator, it follows that PR-box behavior requires \ip) to be an eigenstate of the 
following form: 

6 fc a^) = a fc 6^) = (-l) (fe - 1)(i - 1) |V) 
But these equations imply 

(V#) = (i>\a>ihha2\il>) = (ip\aia 2 \ip) = {^\a 1 b 2 b 2 a 2 \ip) = -(V#) 

which is impossible for any \tp) ^ 0. Therefore, PR-box behavior is impossible even for the temporal 
quantum correlations which we consider here. We could also have concluded this from theorem 13.21 

5 Strength of signaling 

In our Bell-test scenario, the backward no-signaling equations 

P(r, -l\k, 1) + P(r, +l\k, 1) = P{r, -l\k, 2) + P{r, +l\k, 2) Vr, k 

are still true: the marginal probability governing Alice's measurement cannot possibly depend on 
the measurement setting of Bob. However the forward no-signaling equations 

P(-l,s\l,l) + P(+l,s\l,l) = P(-l ) s\2,l)+P(+l,s\2,l) Vs,l (13) 

are typically violated, since the choice of measurement for Alice influences the system state after her 
measurement, and therefore changes the outcome probabilities for Bob. Effectively, what Bob sees 
is not exactly the initial state |^>), but |^>) after undergoing decoherence due to Alice's measurement. 
It is an interesting question to ask how much the no-signaling equations (|I3[) can be violated by 
our quantum-mechanical setup. This is why we want to look at the deviations from (|13[) and 
determine how large they can possibly be in a quantum theory. Since each of these four possible 
quantities involve only one fixed measurement setting I of Bob, we will disregard Bob's choice for 

10 



the rest of this section, and assume that he simply measures any dichotomic observable b. The joint 
probabilities we then consider are of the form P(r, s\k). Then the two signaling quantities are 

S + =P(+1,+1|1)+P(-1,+1|1)-P(+1,+1|2)-P(-1,+1|2) 
S- =P(+1, -1|1) + P(-l, -1|1) - P(+l, -1|2) - P(-l, -1|2). 

Due to the total outcome probability for each choice of measurement being 1, it necessarily holds 
that S+ + S— = 0, independent of whether the system is quantum or not. Therefore the interesting 
question now is, which values of S+ are achievable by quantum mechanics? This is what we are 
going to answer here. 

A priori, £+ can be expected to attain all the values in the interval [— 1, +1]. The extreme 
values of —1 and +1 correspond to perfect signaling in the sense that Bob can definitely tell which 
measurement Alice had chosen. This can be interpreted as a classical communication channel with 
a capacity of 1 bit. 

Theorem 5.1. A signaling level S+ G [—1, +1] is quantum-mechanical if and only if \S+\ < ^ 

Proof. By (Q]) , the signaling quantity S+ can be expressed in terms of the observables and the initial 
state as 

S+ = +\(il>\b\i>) + l(M(aib ai - a 2 ba 2 m (15) 

where most terms have in fact dropped out. This equation implies 

|5+| < \Mb\i>)\ + HMoi&cuMI + \Ma 2 ba 2 m 

Each term within the absolute value brackets in turn can be bounded by 1, since it is the expectation 
value of a ±l-valued observable, so that the bound 15+1 < 1/2 follows. 

Conversely, since the set of allowed for S+ needs to be convex, it is sufficient to show that the 
values +1/2 and —1/2 can be attained. For attaining the value +1/2, we can choose 

\ip) = \x+), a 1 =a Xl a 2 = o-y, b = a x , (16) 

where a direct calculation shows that this indeed has the required property. □ 

As was already mentioned briefly, we may also consider the signaling strength in terms of the 
information which Bob's measurement outcome contains about Alice's choice of setting. This is 
encoded in the two probabilities 

P(s\k) = P(-l,s\k) + P(+l,s\k) 

= \ + \ s ^\b\^) + \ s ^\a k ba k \^) 

which define a classical communication channel on the input alphabet k G {1,2} to the output 
alphabet s G {— 1, +1}. Since Bob's outcome is only dichotomic, we can equivalently consider the 
expectation value of his measurement, 

E(s\k) = P(+l\k)-P(-l\k), fee {1,2} 

and the question then is, which pairs (E(s\l),E(s\2)) can occur quantum-mechanically, and how 
does this bound the classical capacity by which Alice can use her measurements in order to send 
information to Bob? The answer to this question is given in the following theorem: 
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E(s\2) 




E(s\l) 



Figure 1: Possible pairs of E(s\l) and E(s\2) as they can appear in quantum theory. The whole 
square-shaped box is the whole region of principally allowed values —1 < E(s\l), E(s\2) < 1. 



Theorem 5.2. A pair (E(s\l),E(s\2)) can occur quantum-mechanically if and only if 

\E(s\l) + E(s\2)\ <1. 



The maximal communication capacity is log 2 (5/4) 
qubit protocol lit 



0.32 bits, which can be achieved using the 



This result is illustrated in figure [TJ 



Proof. Since E(s\l) + E(s\2) = 2S + , the constraint \E{s\l) + E(s\2)\ < 1 immediately follows. 
On the other hand, the qubit protocol (|16|l achieves E(s\l) = 1, E(s\2) = 0, which is one of the 
four non-trivial vertices of the convex quadrangle shown if figure [1] The other three vertices can be 
attained by the same protocol after possibly switching s <-> — s and a\ ■<-)■ a 2 . Now since the quantum 
region has to be convex, and the quadrangle is the smallest convex set containing its vertices, it 
follows that |i£(s|l) + .E(s|2)| < 1 is also sufficient for the existence of a quantum-mechanical model. 
Now we get to the capacity statement. Since classical communication capacity is a convex 
function of the transition probabilities, we know that the maximal capacity is attained at the 
quadrangle's vertices. Since the four vertices are all simple permutations of the protocol (|16p . the 
corresponding channels have equal capacity, and it is sufficient to calculate the capacity achievable 
by the data (|16p . A direct calculation shows that the optimal input distribution is a relative 
frequency of 3/5 for a\ and 2/5 for a%, resulting in a mutual information of log 2 (5/4) « 0.32 
bits. □ 



12 



6 A temporal version of Hardy's nonlocality paradox 

Hardy's paradox [Mer94| occurs when the joint probabilities have the following properties: 

P(+1,+1|1,1) = 
P(-l,+l|l,2) = 
P(+l,-l|2,l) = l ' 

P(+l,+l|2,2)>0 

This is impossible in any realistic theory where Alice's measurements are non-invasive. We note 
that the only relevant information contained in hidden variables lies in the preexisting values of all 
relevant observables. Hence any (stochastic) hidden variable model is given by a statistical mixture 
of the 16 realistic states 

afaibfbf (19) 

where in this notation (from Lap06| ) , each sign stands for the corresponding measurement outcome 
it determines with certainty, and the four signs are independent of each other. By the assumption 
P(+l, +1|2, 2) > 0, we know that this statistical mixture contains at least one state of the form 

af<4bfbr[. 

But now due to P(— 1, +1|1, 2) = 0, this cannot be one of the two states a^a^bib^ ■ Likewise by 
P(+l, — 1|2, 1) = 0, it cannot be one of the two states a^a^b^b^ ■ Therefore, the statistical mixture 
of realistic states necessarily contains the state 

at a 2 b i b t 

but now this contradicts the assumption P(+l,+l|l,l) = 0! Therefore, the existence of joint 
probabilities with the property (|T51) exhibits a rather strong form of contextuality. Note that this 
kind of reasoning applies to a spatial as well as to a temporal Bell test scenario. 

In fact, (|18p is indeed realizable in quantum theory, and it is known that the maximal value for 
P(+l, +1|2, 2) in a spatial scenario is approximately 0.09 |Mct94] . Here we would like to determine 
the maximal possible value of P(+l,+l|2,2) in the temporal CHSH scenario. Again, since joint 
probabilities for the temporal case comprise those of the spatial case, the maximal temporally 
realizable value of P(+l, +1|2, 2) has to be at least 0.09. We will now proceed to show that one can 
achieve a substantially higher value than this. This shows again that temporal quantum correlations 
are often stronger than spatial quantum correlations. 

Theorem 6.1. The maximal value for P(+l,+l|2,2) in the temporal Hardy paradox is 1/4. 

Proof. In order for a probability P(r, s\k,l) like ([1} to vanish, one needs that 

• either Alice's outcome r by itself is already impossible to occur, i.e. the other outcome — r 
occurs with certainty. This means that the initial state is a — r-eigenstate of a&, ak\ip) — —r\tjj). 

• or Alice's post-measurement state 1+ ^ ak \tp) (unnormalized) is such that Bob's outcome is 
impossible, i.e. it is a —s eigenstate of 6j, 

6z(l + rasOIV') = -*(! + ra fc )W>> 
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Hence, vanishing joint probability is equivalent to 

(l+sbi)(l+ra k )\tp)=0 (20) 

which can be interpreted as a vanishing amplitude for the two outcomes to occur together. So the 
vanishing constraints from (1181) are equivalent to 

(l + 6i)(l + oiM=0, (21) 

(l + 6 2 )(l-ai)|V>=0, (22) 

(l-&i)(l + 02)|V>)=a (23) 

The qubit protocol 

\tp) = \x+), ai = -a x , a 2 = a y , bi = a y , b 2 = -<r x 

does indeed satisfy all of these constraints, and it achieves a value of P(+l, +1|2, 2) = 1/4 as 
promised. The remaining part of the proof is dedicated to showing that this value is optimal. 

For bi, the equations (|2Tj) to (|23|) mean that (l + ai)\ip) has to lie in the — 1-eigenspace of b\ 
(since this vector has zero projection onto the + 1-eigenspace), and similarly that (\ + a 2 )\4>) has to 
lie in the + 1-eigenspace of b%. These eigenspaces are necessarily orthogonal since b± is hermitian. 
Hence, given any initial state \tp) together with ±l-valued observables ai, a 2 , b 2 which satisfy (|2"2"j) . 
we can find an observable b\ which also satisfies (|2ip and ((23)) if and only if 

(l + oi)|V>-L(l + oa)|V>, i-e. (1>\(1 + oi)(l + a 2 )\i>) = 0, (24) 



holds. So this condition is equivalent to (I2TJ) and (|23| together and also comprises the case that 
\ip) is any eigenstate of a\ or a 2 . 

Now imagine that we have \ip), a\ and a 2 such that (|24p is satisfied. Then what can we choose 
for b 2 in order to also satisfy (|2"2")l ? Equation (f2"2")l means exactly that (1 — ax)\ijj) is contained in 
the —1-eigenspace of b 2 . When p stands for the projection operator onto the vector (1 — ai)\ip), 
this means exactly that 

1 - &a 

(Here, the partial order "<" is the usual partial order on the set of hermitian operator^.) On the 
other hand, we have 

(l-oi)|V){V|(l-oi)<P 
since the norm of (1 — ai)\ip) is at most 1. Hence, we can conclude from these two inequalities that 

1 + b 2 < 2 - 2p < 2 - 2(1 - ai)\i/>){ip\(l - ai). 

When plugging this result into the expression for the "paradoxical" probability P(+l,+l|2,2), we 
obtain 

P(+l, +1|2, 2) = |(V»|(1 + a 2 )(t + b 2 )(t + a 2 )\tp) 

< \ (if, |(1 + 02)| it>) - \m(l + oa)(l - aOIVXVKl - oi)(l + 02)|^), 



2 recall that x < y in this order is defined to mean that y — x is positive semi-definite. 
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where it has been used that (l + a2) 2 = 2(l + a 2 ). But now (j2"4l can be applied to evaluate the 
second term by using 

(-01(1 + oa)(l - oi)|V) =2(^|(1 + a 2 )\ip) - (V>|(1 + aa)(l + oi)|^) 

®2(^|(1 + a 2 )\i>). 

Hence we finally end up with 

P(+l, +1|2, 2) < I(^|(l + oa)|V) ~ l^P + a 2 Mf 

which is of the form x — x 2 for x = ■^(ip\(l + a2)\ip). The maximal value of this function is j, hence 
the claim is proven. □ 
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